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FIELD OF THE INVENTION 

5 

This invention concerns artificial promoter libraries and 
a method of constructing an artificial promoter library 
for a selected organism or group of organisms . The inven- 
tion also concerns the individual novel promoters derived 

10 from such libraries. Further, the invention concerns a 
method of optimizing the expression of a gene in a se- 
lected organism by use of promoters from such an artifi- 
cial promoter library for that organism. In principle , 
artificial promoter libraries according to the invention 

15 can be constructed for use in any living organism, but 
presently they have mostly been of value for modulating 
gene expression of microorganisms. In connection with 
this invention the term "microorganism" shall be taken 
broadly to include prokaryotic organisms such as bacteria 

20 as well as eukaryotic microorganisms such as yeasts, 
other fungi and cell lines of higher organisms . 

BACKGROUND OF THE INVENTION 

25 Metabolic engineering of living organisms is still in its 
infancy with respect to industrial applications, de- 
spite the fact that genetic engineering has now been fea- 
sible for more than a decade. To a large extent, this may 
be due to the disappointing outcome of many of the at- 

30 tempts so far to improve strain performance. There are at 
least two reasons for the negative outcome of the at- 
tempts to increase metabolic fluxes: 

One is that the genetic engineer tends to overlook the 
35 subtlety of control and regulation of cellular metabo- 
lism. The expression of enzymes that are expected to be 
rate limiting are increased 10 to 100 fold, e.g. by 
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placing the gene on a high copy number plasmid. Or, a 
branching flux in a pathway is eliminated by deleting a 
gene. Quite often, this will have secondary effects on 
the metabolism, for instance by lowering metabolite con- 
5 centrations that are essential to other parts of the cel- 
lular metabolism (e.g. processes that are essential to 
the growth of the organsim) and the net result may be 
that the overall performance of the cell with respect to 
the desired product is decreased. Instead, it is neces- 
10 sary to tune the expression of the relevant gene around 
the normal expression level and determine the optimal ex- 
pression level, for instance as the level that maximizes 
or minimizes the flux. 

15 The second reason for the negative outcome lies in the 
rate limiting concept itself: both metabolic control 
theory (Kacser and Burns, 1973) and experimental determi- 
nations of control by individual steps in a pathway 
(Schaaff et al . , 1989; Jensen et al . , 1993) have shown 

20 that reaction steps which were expected to be rate lim- 
iting with respect to a particular flux, turned out to 
have no or very little control over the flux. Instead, 
the control and regulation of the cellular metabolism 
turned out to be distributed over several enzymes in a 

25 pathway, and it may be necessary to enhance the expres- 
sion of several enzymes in order to obtain a higher flux. 

According to metabolic control theory, the total flux 
control exerted by all the enzymes in a pathway, should 
30 always sum up to 1 . Therefore, after one enzyme concen- 
tration has been optimized, the flux control will have 
shifted to another enzyme(s), and it may then be useful 
to perform additional rounds of enzyme optimization in 
order to increase the flux further. 



In summary, flux optimization requires 1) fine-tuning of 
enzyme concentration rather than many fold overexpression 
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and often 2) optimization of the level of several enzymes 
in a pathway rather than looking for the rate limiting 
step . 

5 There are now many systems available that allow one to 
increase the gene expression more than 1000 fold and/or 
to turn on gene expression at a particular time point 
during a fermentation process (e.g. using temperature in- 
ducible systems). With respect to tuning the steady state 

10 gene expression in the fermenter, to say 150% or 70% of 
the normal expression level, it becomes more difficult. 
In principle, one could use a lac- type promoter in front 
of the gene of interest, and then add a certain amount of 
an inducer of the lac system, for instance IPTG 

15 ( isopropyl-fi-D-thiogalactoside) , or use a temperature 
sensitive system at the correct temperature. These possi- 
bilities are often not practical for large scale indus- 
trial applications . The alternative is to use a promoter 
that has exactly the right strength. However, such pro- 

20 moters are seldom available, and furthermore one needs a 
range of promoter activities in order to optimize the ex- 
pression of the gene in the first place, see below. 

During the past two decades , much work has been done to 
25 define and optimize the consensus sequences of microor- 
ganisms. In many prokaryotes, one often finds two more or 
less conserved DNA sequences at approximately position - 
10 and -35 relative to the start site for transcription, 
TATAAT and TTGACA, respectively, with aproximately 17 
30 basepaii^ between the two. The dogma in this field is 
that, by including these elements, the resulting promot- 
ers would tend to become strong. Indeed, promoter up mu- 
tations, which are relatively rare events, ususally re- 
sults in a better match to the above consensus sequences, 
35 while down mutations results in a poorer match to the 
consensus sequences or a less optimal distances between 
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these. In addition, when random DNA sequences are cloned 
in place of one of the two consensus sequences, the 
strength of the resulting promoters usually correlate 
with the degree of homology to the consensus sequences . 

5 

In principle, modulation of the strength of promoters 
could then be achieved by basepair changes in the consen- 
sus sequences or by changes in the length of the spacer 
between these. But the impact of such changes on the pro- 
10 moter strength will tend to be large (see example 1 of 
this invention) , and it is therefore not feasible to 
achieve small steps of strength modulation through base- 
pair changes in the consensus sequences. 

15 While the length of the spacer separating the two consen- 
sus sequences is known to play an important role for the 
strength of a promoter, the sequence of the spacers be- 
tween the consensus sequences has usually been considered 
to be of little importance for the strength of the pro- 

20 moters, and attempts to identify additional consensus se- 
quences in the spacer region through mutagenesis have in- 
deed been unsuccesful. So far, nobody has attempted to 
randomize the spacer, while keeping the consensus se- 
quences and the spacer length relatively constant. 

25 

Numerous experiments have been carried out in order to 
define and optimize the consensus sequences of micro- 
organisms, including experiments where at least one of 
the consensus sequences was being randomized. In some of 

30 these experiments a part of the nucleotides surrounding 
the consensus sequences was also randomized in order to 
allow for the generation of promoters with length of 
spacers different from 17 bp, and/or in order to find 
possible new consensus motifs around the consensus se- 

35 quences . The chances that this will generate an efficient 
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promoter is very small and a selection must be applied in 
order to find those rare cases where the homology to the 
consensus sequence is sufficiently high to result in even 
a weak promoter. 

5 

OBJECT OF THE INVENTION 

The promoter libraries that we are aiming at should cover 
the entire range of promoter activities that could become 

10 of interest for engineering of a particular species, 
preferably from the weakest promoters detectable, to the 
strongest promoters possible. Moreover, we aim at cover- 
ing this broad activity range in small steps, say in- 
crease in activity by 50-100% per step in order to be 

15 suitable for the purpose of flux optimization as de- 
scribed above . 

In this invention, we show that the sequence of the spac- 
ers between the consensus sequences are far more impor- 

20 tant than it has previously been appreciated. The spacer 
sequence can have a strong impact on promoter strength, 
when 1) a major part of the spacer sequence is varied si- 
multaneously and in a random manner and 2), at the same 
time, at least half of the consensus sequences are kept 

25 constant. We show that if these two conditions are ful- 
filled, our method can be used to generate promoters that 
cover wide ranges of activities, including very strong 
promoters. The range of promoter activities is covered in 
small steps of activity change which makes these promot- 

30 ers very suitable for metabolic engineering purposes. In 
addition, we show that the method can be used to generate 
promoters for a wide range of organisms and at an unusual 
high frequency. 
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SUMMARY OF THE INVENTION 

The present invention provides an artificial promoter li- 
brary for a selected organism or group of organisms, cora- 
5 prising a mixture of double stranded DNA fragments the 
sense strands of which comprise at least two consensus 
sequences of efficient promoters from said organism or 
group of organisms, or parts thereof comprising at least 
half of each, and surrounding or intermediate nucleotide 
10 sequences (spacers) of variable length in which at least 
7 nucleotides are selected randomly among the nucleobases 
A, T, C and G, with the proviso that previously known 
promoter sequences and promoter sequences isolated from 
natural sources are not comprised, 

15 

The broadest variation in promoter strengths is obtained 
when at least 10, preferably at least 12, and more pref- 
erably at least 14, nucleotides in the spacer sequence(s) 
are selected randomly among the nucleobases A, T, C and 
20 G. 

The sense strands of the double stranded DNA fragments 
may also include a regulatory DNA sequence imparting a 
specific regulatory feature to the promoters of the li- 
25 brary. Such specific regulatory feature is preferably ac- 
tivation by a change in the growth conditions, such as a 
change in the pH, osmolarity, temperature or growth 
phase . 

30 For cloning purposes the double stranded DNA fragments 
usually have sequences comprising one or more recognition 
sites for restriction endonucleases added to one of or 
both their ends; most conveniently sequences specifying 
multiple recognition sites for restriction endonucleases 

35 (multiple cloning sites MCS). 
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The selected organism or group of organisms may be se- 
lected from prokaryotes and from eukaryotes , in particu- 
lar from prokaryotes and eukaryotic microorganisms such 
as yeasts, other fungi and cell lines of higher organ- 
5 isms . 

An interesting group of prokaryotes i.a. in the dairy in- 
dustry consists of lactic acid bacteria of the genera 
Lactococcus , Streptococcus , Enterococcus , Lactobacillus 

10 and Leuconostoc, in particular strains of the species 
Lactococcus lactis and Streptococcus thermophilus . Other 
interesting prokaryotes are bacteria belonging to the 
genera Escherichia, Bacillus and Pseudomonas , in particu- 
lar the species Escherichia coli f Bacillus subtilis and 

15 Pseudomonas putida. 

In an artificial promoter library for prokaryotes said 
consensus sequences may for example comprise the -10 sig- 
nal (-12 to -7): TATAAT and at least one activator pro- 
20 tein binding site upstream of the -10 signal or parts 
thereof comprising at least 3 conserved nucleotides of 
each . 

Most often the consensus sequences to be retained in an 
25 artificial promoter library for prokaryotes will comprise 
the -35 signal (-35 to -30): TTGACA and the -10 signal (- 
12 to -7): TATAAT or parts of both comprising at least 3 
conserved nucleotides of each. 

30 More efficient promoters are usually obtained when said 
consensus sequences comprise from 4 to 6 conserved nu- 
cleotides of the -35 signal and from 4 to 6 conserved nu- 
cleotides of the -10 signal, preferably 5 or 6, and more 
preferably all 6 nucleotides of each. The most efficient 

35 promoters are obtained when said consensus sequences fur- 
ther comprise intervening conserved motifs, e.g. selected 
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from the conserved motifs -44 to -41: AGTT , -40 to -36: 
TATTC , -15 to -14: TG , and +1 to +8: GTACTGTT . 

In such promoters the length of the spacer between the - 
35 signal and the -10 signal should be 14-23 bp, prefera- 
bly 16-18 bp, and more preferably 17 bp. This should be 
understood to mean the spacer length between the hexamer 
signals, even when some of the nucleotides in the signals 
have been mutated . 

In eukaryotic organisms said consensus sequences should 
comprise a TATA box and at least one upstream activation 
sequence (UAS) . 

An interesting eukaryotic microorganism is the yeast spe- 
cies Saccharomyces cerevisiae, normal baker's yeast. In 
promoters to be used in Saccharomyces the consensus se- 
quences may further comprise a transcription initiation 
signal (TI box) functioning in Saccharomyces cerevisiae. 



In a specific embodiment /of an artificial promoter li- 
brary according to the i/vention for Saccharomyces cere- 
visiae said consensus Sequences comprise the TATA box: 
TAT AAA, the UAS G CN4 p : /TGACTCA, and the TI box: 
25 CTCTTAAGTGCAAGTGACTG^GA, which also functions as the 
binding site for thfe arginine repressor, argR. 

The individual promoters of the artificial promoter li- 
braries defined above are also comprised by the inven- 

30 tion. Specific promoters which have been constructed ac- 
cording to the following examples are those stated in the 
SEQ IDs Nos. 5 to 58 below. The invention further com- 
prises artificial promoters which are derived from pro- 
moters defined by the artificial promoter libraries of 

35 the invention. 
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The present invention also provides a method of con- 
structing an artificial promoter library for a selected 
organism or group of organisms, which comprises selecting 
at least two consensus sequences of efficient promoters 
5 from said organism or group of organisms; synthesizing a 
mixture of single stranded DNA sequences comprising said 
consensus sequences, or parts thereof comprising at least 
half of each, and surrounding or intermediate nucleotide 
sequences (spacers) of variable length in which at least 
10 7 nucleotides are selected randomly among the nucleobases 
A, T, C and G; and converting the single stranded DNA se- 
quences into double stranded DNA fragments by second 
strand synthesis . 

15 As previously mentioned, the broadest variation is ob- 
tained when at least 10, preferably at least 12, and more 
preferably at least 14, nucleotides in the spacer se- 
quence^) are selected randomly among the nucleobases A, 
T, C and G. 

20 

In order to obtain an artificial promoter library which 
is susceptible to regulation, the single stranded DNA se- 
quences which are synthesized may include a regulatory 
DNA sequence imparting a specific regulatory feature to 
25 the promoters of the library. Such specific regulatory 
feature is preferably activation by a change in the 
growth conditions, such as a change in the pH, osmolar- 
ity, temperature or growth phase. 

30 Also, in order to obtain an artificial promoter library 
suitable for cloning, a sequence specifying one or more 
recognition sites for restriction endonucleases may be 
added to one of or both the ends of the single stranded 
DNA sequences in the synthesis, or a linker comprising 

35 such restriction sites may be ligated to one of or both 
the ends of the double stranded DNA fragments. Most con- 
veniently such sequences specify multiple recognition 
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sites for restriction endonucleases (multiple cloning 
sites MCS) • 

The selected organisms for which artificial promoter li- 
5 braries can be prepared by the method according to the 
invention and the various degenerated sequences to be 
chosen for the promoter libraries of specific organisms 
are the same as discussed above for the artificial pro- 
moter libraries per se. 

10 

With respect to possible uses of the artificial promoter 
libraries described above, the invention further provides 
a method of optimizing the expression of a gene in a mi- 
croorganism, which comprises 

15 

a) selecting a set of promoters covering a range of pro- 
moter activities in relatively small steps of activity 
change from an artificial promoter library according to 
any one of claims 1-26 or constructed by the method ac- 

20 cording to any one of claims 30-55; 

b) cloning said set of promoters into said organism plac- 
ing in each clone said gene under the control of at least 
one promoter from the set; 

25 

c) growing the selected clones and screening them to find 
the one showing optimized flux of product formation. 

This method is preferably used with organisms selected 
30 from the group consisting of prokaryotic and eukaryotic 
microorganisms such as bacteria, yeasts, other fungi and 
mammalian cell lines. 




10a 



In other aspects, the invention pertains to a method of isolating a promoter sequence 
being capable of optimizing the expresssion of a gene in a selected organism, the 
method comprising 

5 

(i) constructing, using the above method of constructing a promoter library, a set of 
promoters covering, with respect to promoter strength, a range of promoter activities, 

(ii) cloning said set of promoters into the selected organism placing in each clone the 
10 gene to be expressed under the control of at least one promoter of the set, 

(iii) cultivating the clones and selecting the clone showing optimized flux of gene 
product formation, and 

15 (iv) isolating said promoter sequence from the clone showing optimized flux of gene 
product formation, 

and a promoter sequence that is capable of optimising the expression of a gene in a 
selected organism, which promoter sequence is obtainable by the above method of 
20 isolating a promoter sequence. 



AMENDED SHEET 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. A library of artificial promoters for L. lac- 
tis , from example 1. The promoter activities (Miller 
5 units) were assayed from the expression of a reporter 
gene (lacLM) encoding b-galactosidase transcribed from 
the different synthetic promoter clones, on the promoter 
cloning vector pAK80, in strain MG136 3, grown in GM17 me- 
dium supplemented with 2 |j.g/ml erythromycin. The patterns 
10 of the data points indicate which promoter clones dif- 
fered in either the -35 or the -10 consensus sequence, or 
in the length of the spacer between these two consensus 
sequences . 

15 Figure 2. A library of artificial heatshock regulated 
promoters for L. lactis, from example 2. The promoter ac- 
tivities (Miller units) were assayed from the expression 
of a reporter gene (gusA) encoding P-glucuronidase , tran- 
scribed from the different synthetic promoter clones, on 

20 the chromosome in strain LB436. The cells were grown in 
GM17 medium supplemented with 2 |ig/ml erythromycin, and 
at two different temperatures, 30 and 37 °C . The asssay 
for ^-glucuronidase was parallel to the P-galactosidase 
assay (see example 1) except that X-gluc was used as the 

25 substrate. 

Figure 3. The library of artificial promoters from exam- 
ple 1, assayed for activity in E. coli. The promoter ac- 
tivities (Miller units) were assayed from the expression 

30 of a reporter gene (lacLM) encoding P-galactosidase tran- 
scribed from the different synthetic promoter clones, on 
the promoter cloning vector pAK80, in strain BOE270, 
grown in LB medium supplemented with 200 ng/ml erythromy- 
cin. The patterns of the data points indicate which pro- 

35 moter clones differed in either the -35 or the -10 con- 
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sensus sequence, or in the length of the spacer between 
these two consensus sequences . 

Figure 4. A library of artificial promoters for S. cere- 
5 visiae r from example 7. The promoter activities (Miller 
units) were assayed from the expression of a reporter 
gene (lacZ) encoding P-galactosidase transcribed from the 
different synthetic promoter clones, on the promoter 
cloning vector pYLZ-2 in S. cerevisiae strain SG24 (URA3- 
10 52), grown in SD minimal medium supplemented with 2% glu- 
cose. YP24 and YP435 has a 1 bp deletion and a point mu- 
tation, respectively, in the UASgcN4p binding site. 
pTKlOl contains a promoter in which the UAS sequence has 
been deleted and the TATA box is present, 

15 

Figure 5. Regulation of artificial yeast promoters by ex- 
ternal arginine, from example 7. The promoter activities 
(Miller units) were assayed from the expression of a re- 
porter gene (lacZ) encoding P-galactosidase transcribed 

20 from the different synthetic promoter clones, on the pro- 
moter cloning vector pYLZ-2 in S. cerevislae strain SG24 
(URA3-52), grown in SD minimal medium supplemented with 
2% glucose, with (SDG) or without 1 g/1 arginine 
(SDG+arg). YP183 has a 13 bp deletion in the binding site 

25 for the argR repressor. 

Figure 6, Regulation of artificial yeast promoters by ex- 
ternal amino acids, from example 7. The promoter activi- 
ties (Miller units) were assayed from the expression of a 
30 reporter gene (lacZ) encoding P-galactosidase transcribed 
from the different synthetic promoter clones, on the pro- 
moter cloning vector pYLZ-2 in 5. cerevisiae strain SG24 
(URA3-52), grown in SD minimal medium supplemented with 
2% glucose (SDG) or complex medium (containing amino ac- 
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ids) without uracil (SC-ura) . YP24 has a 1 bp deletion in 
the UAScCN4p binding site. 

DETAILED DESCRIPTION OF THE INVENTION 

5 

In our method, degenerated oligonucleotides are synthe- 
sized for the organism or group of organisms for which a 
promoter library should be constructed. The sequences of 
the oligonucleotides are written by combining the avail- 

10 able knowledge from the literature, on the features that 
makes a promoter function efficiently in that particular 
organism. The amount of information that needs to be 
fixed in the oligonucleotide is somewhat variable among 
different organisms. In E. coll for instance, promoters 

15 of considerable strength may be formed by less perfect 
matches in the -35 and -10 consensus sequences and by 
spacing between these sequences deviating from 17bp 
(TTGACA and TATAAT respectively), whereas the require- 
ments for strong promoters in £. lactls appear to be more 

20 strict. 

Secondly, the single stranded oligonucleotides are con- 
verted into double stranded DNA fragments and cloned into 
a promoter probing vector. The promoter-containing clones 

25 are identified e.g. by their ability to give colonies 
with various extents of colour on indicator plates. This 
should in principle give us only very strong promoters, 
but we discovered that by allowing the spacer sequences 
between the consensus sequences to vary in a random man- 

30 ner, the strength of the resulting promoters are modu- 
lated. In fact, using this method we obtained promoter 
libraries, spanning the entire range of promoter activi- 
ties that is likely to become of interest, in small steps 
of activity change, 

35 
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Optimization of gene expression could then be achieved as 
follows: 1) From the promoter library one chooses promot- 
ers that have e.g. 25%, 50%, 100%, 200% and 400% of the 
strength of the wild-type promoter. 2) Then, these pro- 
5 moters are cloned in place of the wild-type promoter up- 
stream of the gene of interest. 3) The magnitude of the 
variable to be optimized (e.g. the flux through a path- 
way) that is obtained with each of these five constructs 
is then measured and the optimal construct is used di- 

10 rectly as the production organism. It may be necessary to 
fine-tune the expression further or to expand the range 
of promoter activities. A direct advantage of this system 
over the inducible systems described above is that once 
the optimal promoter activity has been determined, the 

15 strain is in principle ready for use in the fermentation 
process . 

In one preferred embodiment the random spacers method of 
the invention is used for generating a series of consti- 

20 tutive promoters for the Gram-positive bacterium, Lacto- 
coccus lactls. In other preferred embodiments we show 
that promoters generated by the random spacers method of 
the invention are functional in at least two species of 
Gram-negative bacteria, Escherichia coll and Pseudomonas , 

25 as well as in the Gram-positive bacterium Bacillus sub- 
tills. We also show that the strength of the individual 
promoter is dependent on which organism it is being used 
in, i.e. that in some organisms a particular promoter is 
strong, in others it is weak, but in all the organisms 

30 tested, the promoters cover a broad range of activities, 
in small steps of activity change. 

Often it is desirable to activate gene expression to a 
certain extent and at a certain stage of a fermentation, 
35 e.g. because the gene product that is expressed inhibits 
the growth of the cells. It is then useful to combine the 
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above technique for obtaining promoters of different 
strength with some regulatory mechanism, e.g. so that the 
promoter will be activated by a change in the pH, tem- 
perature or growth phase. 

5 

Thus, in another preferred embodiment the random spacers 
method of the invention is used for generating a series 
of specifically regulated promoters. As illustrated in 
Example 2, the above approach is used in combination with 
10 specific regulatory DNA sequences to generate a library 
of heatshock-regulated promoters for the Gram-positive 
bacterium, Lactococcus lactis. 

In addition to prokaryotes , eukaryotic microorganisms 
15 (yeast and other fungi as well as mammalian cell lines,) 
are important microorganisms for production of a range of 
organic compounds and various proteins. It is therefore 
of interest to develop the above approach for modulating 
gene expression in these organisms as well. Thus, in yet 
20 another preferred embodiment, as illustrated in Example 
7, the random spacers method of the invention is used for 
generating a series of promoters for the bakers yeast, 
Saccharomyces cerevlsiae. The promoters are here equipped 
with GCN4p and ARGR regulation. 

25 

The regulation of transcription initiation in the eukary- 
otic cell is somewhat more complex compared to the pro- 
karyote. The transcription start site is normally pre- 
ceded by a so-called TATA box that contains the consensus 

30 sequence TATAAA or parts hereof, but unlike in the pro- 
karyote, the distance from the TATA box to the transcrip- 
tion start site is much less defined. In Saccharomyces 
cerevlsiae this distance is typically 40-120 nucleotides 
(Oliver and Warmington, 1989). The so-called -35 consen- 

35 sus hexamer which is found in many prokaryotic promoters 
is absent in Saccharomyces cerevlsiae. Instead so-called 
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upstream activation sequences (UAS) are found upstream of 
the transcription initiation site. These UAS are recog- 
nised by specific DNA binding proteins that can then act 
as activators of transcription initiation. For instance, 
5 the UAS sequence that is found upstream of the genes in- 
volved in aminoacid biosynthesis, UASQCN4p/ consists of a 
DNA sequence that specifies a binding site for the GCN4 
protein, which activates the transcription of these genes 
(Hinnebusch, 1992). In contrast to prokaryotes, where the 

10 distance between the promoter elements appears to be 
critical for the strength of promoters (see example 1), 
the distance between the TATA box and the UAS sequence in 
eukaryotic promoters is highly variable and may be up to 
about 1000 bp. Some genes even contain more than one copy 

15 of the UAS, but one seems to be sufficient for activa- 
tion. 

One of the UAS sequences known from yeast is the binding 
site for the GCN4 protein. Promoters that contain the 

20 binding site for GCN4 protein should be regulated by the 
status of amino acid supply in the growth medium: in the 
absence of amino acids, the GCN4 protein is formed and 
binds specific UAS sequences to stimulate transcription 
at promoters involved in biosynthesis of aminoacids. The 

25 consensus sequence for the GCN4 protein binding site 
(UASGCN4p) is a short inverted repeat, TGACTCA. 

A promoter in Saccharomyces cerevislae that is activated 
by the GCN4 protein is the ARG8 promoter. In this pro- 

30 moter, there is only one copy of the UASccN4p sequence, 
and it is located 59 bp from the TATA box (we refer to 
this distance as spacer 1). Transcription initiation 
takes place some 40-60 nucleotides downstream of the TATA 
box. The ARG8 promoter also contains a DNA sequence that 

35 functions as the binding site for the arginine repressor, 
argR (Crabeel et al . , 1995), which makes the promoter 
four fold repressible by external arginine. 
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Thus, in this case the promoter is located within 136 bp, 
and it contains two regulatory features, which makes the 
system attractive for developing a promoter library by 
the random spacers method of the invention as outlined in 
5 the previous examples. But the method is not limited to 
this model system; in principle, any combinations of TATA 
boxes, UAS sequences, repressor binding sites etc., sepa- 
rated by spacers smaller than about 1000 nucleotides, 
should be suitable as a starting point for this method. 

10 

EXAMPLE 1 

Design of a degenerated oligonucleotide for a L. lactis 
promoter 1 ibrary . 

15 

According to the literature (see review in de Vos & Si- 
mons, 1994), strong promoters in L . lactis tend to have 
the following nucleotide sequences in common (numbers re- 
fer to the position relative to the transcription initia- 

20 tion site, which is given number +1): -12 to -7: TATAAT; 
-15 to -14: TG; -35 to -30: TTGACA. The spacing between - 
10 and -35 seems to be 17 nucleotides. However, closer 
comparison of the promoter sequences that have been pub- 
lished for L. lactis reveals that in a number of posi- 

25 tions besides the ones mentioned above, nucleotides are 
more or less well conserved. Some of these positions are: 
-1: A; -3: A or T (=W); -6: A; -13: A or G (=R); -40 to - 
36: TATTC. In addition, Nilsson and Johansen (1994, BBA) 
pointed out two motives, +1 to +8: GTACTGTT, and -44 to - 

30 41: AGTT, that appear to be well conserved between rela- 
tively strong promoters (promoters for transfer RNA and 
ribosomal RNA operons ) from L. lactis. These motives may 
confer both strength and growth rate dependent expression 
from the promoter. 

When these additional motives are included, one arrives 
at the following 53 nucleobase degenerated sequence for 
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an efficient promoter in L . lactls. Out of these 53 nu- 
cleobases, 34 basyes are conserved, two are semi-conserved 
(R and W) and IT are allowed to vary randomly between the 
four nucleobaaes . 

5 9 AGTTTATTC TTGACAN NNNNNNNNNNNNNTGR TATAATA NNWN AGTACTGTT 3' 




In addition, this degenerated sequence is flanked by se- 
quences that specify multiple recognition sites for re- 
10 striction endonucleases (multiple cloning site MCS), and 
the actual oligonucleotide mixture to be synthesized has 
the following degenerated sequence reported in SEQ ID No. 
1: 



15 

Mbol 
Dpnl 
Alwl 
NlalV 
20 BstYI 



25 



35 



BamHI Msel 

Alwl Aflll Sspl Nsil 
1 ..... 
' CGGGATCCTTAAGAATATTATGCATNNNNNAGTTTATTCTTGACANNNNNNNNNNNNNNT 



Alul 
PvuII 
NspBII 
Sf cl 

30 Msel Fnu4HI 

Rsal Hpal PstI 
Seal Hindi Bbvl EcoRI 
61 100 
GRTATAATANNWNAGTACTGTTAACTGCAGCTGAATTCGG 3 1 



A mixture of oligonucleotides according to this specifi- 
cation was synthesized by Hobolth DNA synthesis. 



This oligonucleotide mixture is single stranded initially 
40 and must be converted into double stranded DNA fragments 
for the purpose of cloning. This was done by synthesizing 
in vitro a 10 bp oligonucleotide, having a sequence com- 
plementary to the 3' end of the promoter oligonucleotide. 
This oligonucleotide was then used as a primer for second 
45 strand synthesis by the Klenow fragment of DNA polymerase 
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I in the presence of dATP, dCTP, dGTP and dTTP . In this 
way the second DNA strand became exactly complementary to 
the first DNA strand. 

5 The result is a mixture of 100 bp double stranded DNA 
fragments containing multiple recognition sites for re- 
striction endonucleases in both the 3' and 5' end. These 
DNA fragments were then cut with restriction endonucle- 
ases in order to create suitable "sticky" ends compatible 

10 with the ends of the vector DNA fragment, pAK80 
(Israelsen et al . , 1995). pAK80 is a shuttle vector, 
meaning that it has replication origins for propagation 
in both E. coll and L . lactls . In this way, the cloning 
steps can be conveniently performed in E. coll, while the 

15 subsequent physiological experiments can be done in L. 
lactls. Furthermore, pAK80 carries a promoterless (3- 
galactosidase reporter gene system (lacLM) downstream a 
multiple cloning site. Thus, pAK80 does not express the 
lacLM genes, unless a promoter is inserted in the multi- 

20 pie cloning site. 

Two cloning strategies were used for cloning the mixture 
of double stranded DNA fragments into the cloning vector 
pAK80: 

25 

1) The mixture was digested with BamHI and PstI and the 
vector pAK80 with PstI and Bglll (Bglll is compatible 
with BamHI) . 

30 2) The mixture was digested with Sspl and Hindi and the 
vector pAK80 with Smal (all three enzymes produce blunt 
end DNA fragments). 

In both cases the vector DNA was further treated with 
35 Calf Intestine Phosphatase (CIP) to prevent religation of 
the cloning vector. Subsequently, the fragment mixture 
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and vector DNA were ligated overnight at 16 °C using T4 
DNA ligase and standard ligation conditions. 

Ligation mixtures were transformed into E. coll K-12 
5 Alac, with selection for erythromycin resistance. Cells 
of the E. coll K-12 strain, MT102, were made competent 
using standard treatment with Ca ++ ions (Maniatis et al . , 
1982). Ligation mixtures were then transformed into these 
cells using a standard transformation procedure (Maniatis 

10 et al., 1982), and the resulting clones were screened for 
(3-galactosidase activity that will produce blue colonies 
on plates containing X-gal ( 5-bromo-4-chloro-3-indolyl-J3- 
D-galactoside) . The transformation mixture was plated on 
LB plates containing 200 \xg/ml erythromycin, 1% glycerol 

15 and 100 \xg/ml X-gal. 150 erythromycin resistant transfer- 
mants were obtained, all white initially, but after pro- 
longed incubation (two weeks at 4 °C) , 46 of these colo- 
nies had become light blue. Thus, using strategy 1) we 
found 17 blue colonies (CP30 to CP46), and using strategy 

20 2) we found 29 blue colonies (CP1 to CP29). 

Plasmid DNA was isolated from each of these clones (CP1 
to CP46) and analysed by restriction enzyme mapping. 
Nearly all plasmids (except CP31 and CP43) contained pro- 
25 moter fragments inserted into the MCS of pAK80, in the 
orientation that would direct transcription of the other- 
wise promoterless lacLM genes on this vector. 

These 4 6 plasmid DNA preparations were then transformed 
30 into L. lactls subspecies lactls MG1363 with selection 
for erythromycin resistance. Cells of the L. lactls sub- 
species lactls strain, MG1363 (Gasson, 1983) were made 
competent by growing the cells overnight in SGM17 medium, 
containing 2% glycine, as described by Holo and Nes 
35 ( 1989). Plasmid DNA from each of the 46 clones described 
above was then transformed into these cells using the 
electroporation procedure (Holo and Nes, 1989). After re- 



WO 98/07846 




PCT/DK97/00342 



21 



generation, the cells were plated on SR plates containing 
2 |ig/ml erythromycin- Subsequent screening for blue color 
on X-gal plates revealed large differences in p- 
galactosidase activity between the 46 clones; some clones 
5 gave dark blue colonies after 24 hours of incubation, 
others only light blue colonies after more than 1 week of 
incubation. 

The P-galactosidase activities of liquid cultures of the 
10 4 6 clones in MG136 3 were then determined as described by 
Miller (1972) and modified by Israelsen et al . ( 1995). 
Cultures of the strain MG1363, each carrying one of the 
46 plasmid derivatives of pAK8 0, were grown in Ml 7 medium 
supplemented with 1% glucose overnight at 30 °C . 25-100 
15 \il of these cultures were then used in the subsequent P- 
galactosidase assay, except in the case of the weakest 
promoter clones, where 2 ml of culture was used (after 2 0 
fold concentration by centrif ugation) . These results are 
shown in Figure 1. Apparently, there are very large dif- 
20 ferences in the efficiency of the cloned promoter frag- 
ments, and together these clones cover a range of pro- 
moter activities from 0.3 units of P-galactosidase activ- 
ity to more than 2000 units, which is probably the 
strongest promoters known for this organism 

25 

In addition, the broad range is covered by small changes 
in activity and, therefore, these promoter fragments will 
allow us not only to obtain a wide range of expression of 
genes in L. lactis, but also to tune the expression of 
30 genes in L. lactis in small steps for the purpose of flux 
optimization . 

DNA sequencing of the 46 clones described above revealed 
that most of the inserted promoter fragments had the DNA 
35 sequence that was originally specified for the oligonu- 
cleotide design (see above), whereas the sequence of the 
remaining fragments deviated slightly from that sequence. 
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Most of the promoter fragments that gave the lower ac- 
tivities in the p-galactosidase assay (70 units of p- 
galactosidase or less) had either an error in one of the 
consensus sequences or a 16 bp spacer between the consen- 
5 sus sequences. This result is in accordance with the 
dogma , i.e. that changes in the consensus sequences 
have strong effects on the activity of a given promoter, 
and emphasizes the fact that a more subtle approach is 
needed in order to generate a promoter library that cov- 

10 ers a range of activities in small steps of activity 
change. Clearly, if we would have allowed only changes in 
the consensus sequences and/or changes in the length of 
the spacer, instead of allowing the sequence of the spac- 
ers to vary randomly, only fairly weak promoter clones 

15 would have resulted, and the resulting library would not 
be suitable for metabolic engineering. 

In general, the clones that gave high activities (more 
than 70 units) had the same sequence as specified by the 

20 oligonucleotide. In total, the activity of the clones 
that had intact consensus sequences and 17 basepair 
spacer length, spanned activities from 5 units (CP4) to 
2000 units (CP25). These results therefore show that at- 
least a 400 fold change in promoter activity can be ob- 

25 tained by randomizing the spacer while the consensus se- 
quences are kept constant. 

Usually, for metabolic engineering purposes, relatively 
strong promoters are desired. However, there may also be 

30 cases where rather weak promoters are needed. The rela- 
tively few errors that had occurred during synthesis of 
the above oligonucleotide mixture, were not intended to 
be present in the promoter fragments; and our data then 
suggest that it may be useful to generate, deliberately, 

35 a mixture of oligonucleotides which have a low percentage 
of errors in the consensus sequences . 
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Enzymes used in the various enzymatic reactions above 
were obtained from and used as recommended by Pharmacia 
and Boehringer. 

5 EXAMPLE 2 

Design of a degenerated oligonucleotide for a library of 
temperature regulated I*. lactis promoters. 

10 This example illustrates the development of a temperature 
regulated promoter library for L. lactis. A regulatory 
element comprising an eiglrt basepair inverted repeat that 
has been shown to be involved in the heatshock response 
of L. lactis is inserted a few base pairs upstream of the 
15 -35 sequence. The minimal extent of such a regulatory 
element seems to be y2 7 basepairs : 

5 ' -TTAGCACTCNNNNNNNNNGAGTGCTAA- 3 ' 

IR spacer IR 

20 

containing a 9 bp (or longer) inverted repeat (IR) sepa- 
rated by 9 (or fewer) basepairs. It should therefore be 
possible to combine this inverted repeat with the ap- 
proach for obtaining constitutive promoters of different 
25 strength and thus obtain a series of promoters with vari- 
ous basal activities which can be induced several fold by 
changing the temperature of the culture medium. 

Therefore, an oligonucleotide was designed, which in- 
30 eludes the core part (from position -35 to +6) of the se- 
quence from the constitutive promoter series above (see 
example 1 and SEQ ID No. 1). The sequence upstream of the 
-35 hexamer has been replaced by the above inverted re- 
peat sequence, and the sequence downstream of position +6 
35 has also been modified, eliminating two conserved regions 
compared to example 1 (+1 to +8: GTACTGTT and -44 to -41: 
AGTT, which have been implicated in growth rate regula- 
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tion, but which turned out to be dispensable with respect 
to creating strong promoters for L . lactis, see example 
1). The sequence of the spacer (sp.l) between the two in- 
verted DNA sequences in the inverted repeat was here al- 
5 lowed to vary randomly in order to see whether this had 
any effect on the temperature regulation of the resulting 
promoters, e.g. how many fold they could be induced by 
changing the temperature. The importance of the spacing 
(sp.2) between the inverted repeat and the -35 hexamer is 
10 not known, but in principle this region may contribute to 
or modulate the heatshock response of promoters. In order 
to limit the number of parameters, however, we have cho- 
sen here to include a naturally occuring configuration 
(derived from the dnaJ promoter from L. lactis; van As- 
15 seldonk et al., 1993): a short spacer sequence consisting 
of 5 times T. 

When these sequences arje combined, one arrives at the 
following 7 3 bp consensus sequence for a temperature 
0 regulated promoter in/li. lactis. Out of these 7 3 bp, 45 
are conserved, two aire semi-conserved (R and W) and 26 
are allowed to vaiy randomly between the four nucleo- 
bases . -/ 

25 5 ' TTAGCACTCNNNNNNNNNGAGTGCTAATTTTT TTGACAN NNNNNNNNNNNNNTGR 
IR spacer 1 IR sp.2 spacer 3 

TATAATANNWN AGTAC TG 3' 

30 In addition, this degenerated sequence was flanked by se- 
quences that specify multiple recognition sites for re- 
striction endonucleases (multiple cloning sites MCS), and 
the actual oligonucleotide mixture that is being synthe- 
sized has the following degenerated sequence reported in 

35 SEQ ID No. 2: 
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Mbol 
Dpnl 
Alwl 
NlalV 

5 BstYI Msel Msel 

BamHI Alul Sspl 
Alwl Hindlll Asel 

1 

5 1 CGGGATCCAAGCTTAATATTAATTAGCACTCNNNNNNNNNGAGTGCTAATTTTTTTGACA 
10 IR IR -35 

Alul 
PvuII 
NspBII 

15 Sfcl EcoRI 

PstI Apol 
Rsal Fnu4HI Mael 
Seal Bbvl Xbal 
61 A . T . . . 113 

20 NNNNNNNNNNNNNNTGGTATAATANNANAGTACTGCAGCTGTCTAGAATTCGG 3 1 

-15 -10 +1 



This oligonucleotide mixture was converted into double 
stranded DNA fragments (DSDF) and cloned into the pro- 

25 moter cloning vector pLB85i. pLB85i has an origin for 
replication in E. coll but not in L. lactls and se- 
lectable markers for both organisms . Instead of an origin 
of replication for L . lactls, it contains the attP se- 
quence which can direct the insertion of the plasmid into 

30 the L. lactls chromosome if the int gene product is sup- 
plied in trans. This example therefore also serves to il- 
lustrate the modulation of chromosomal ly encoded genes, 
using promoters generated through the random spacers 
method. pLB85i is also a promoter cloning vector and con- 

35 tains a multible cloning site upstream of a promoterless 
gusA gene. gusA is similar to the lacZ and lacLM screen- 
ing sytems used in the other examples, except that the 
substrate is X-gluc instead of X-gal . It was chosen as 
the reporter gene for this particular application, where 

40 heatshock regulated promoters are supposed to be ana- 
lysed. This was because the gusA gene product is not heat 
labile, which seems to be a problem connected with the 
lacLM gene used in example 1. 
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The DSDF mixture was here digested with Xbal and Asel and 
the vector pLB85i was digested with Xbal and Ndel (Ndel 
is compatible with Asel) and further treated with alka- 
line phosphatase to remove the 5 phosphate groups from 
5 the vector. After ligation of the DFDS mixture and the 
vector, the ligation mixture was transformed into strain 
KW1, a gusA negative E. coll strain, with selection for 
ampicillin resistance. This resulted in approximately 100 
colonies of which 80% had different blue color intensi- 

10 ties, indicating that these clones were carrying putative 
heat shock promoter fragments, 20 blue clones were picked 
for further analysis. Restriction analysis of plasmid 
preparations from these clones showed that they had in- 
serts of aproximately the right size. Subsequently, 6 of 

15 these plasmid preparations were used to transform LB436 
(a derivative of L. lactls MG1363 which contain a second 
plasmid, pLB81, supplying the necessary int gene for in- 
tegration of the plasmids into the attB site on the L. 
lactls chromosome) . From each of the transformations 

20 colonies were isolated with putative integrations of the 
constructs into the attB attachment site on the L. lactls 
chromosome. The integration of the constructs was con- 
firmed by standard colony PGR, using one primer in the 
attB region and one primer in the pLB85i region. The 

25 clones were subsequently tested for their abillity to 
form blue colonies on GM17 medium supplemented with 
erythromycin and 100 ng/ml X-gluc at 30 °C and 37 °C, re- 
spectively, and the clones showed a clear difference in 
color intensity at the two temperatures, indicating that 

30 the promoter activities were now under temperature regu- 
lation. Next the gusA expression from the heat shock pro- 
moters was measured in liquid cultures at the two tem- 
peratures, for 5 selected heatshock-regulated promoters, 
see Figure 2. The clones had different activities and 

35 covered a broad range of promoter activities at 30 °C, 
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and 4 out of the 5 clones (all except HP 6 ) gave higher 
promoter activities at 37 °C, which shows that the pro- 
moters were indeed temperature regulated. Interestingly, 
the promoters were regulated by almost the same fold, 
5 i.e. from 1.7 to 2.3 fold by the temperature shift from 
30 to 37 °C, which indicates that the spacer (spacer 1) 
in the inverted repeat is of minor importance with re- 
spect to determining the fold of induction of these arti- 
ficial promoters. 

10 

We have here been looking at the accumulated gusA activ- 
ity, and although the data clearly show that the promot- 
ers are temperature regulated, one skilled in the art of 
analysing gene expression will appreciate that this makes 

15 it more difficult to observe the changes in promoter ac- 
tivities, that are brought about for instance by some 
change in external parameter such as temperature. In ad- 
dition, it has been shown that the activity of heatshock 
regulated promoters is temporarily ten fold higher imme- 

20 diately after the temperature shift, than it appears when 
the steady state levels are compared as they are on Fig- 
ure 2 . To observe the change in promoter activity more 
carefully, one should look at the rate of protein or mRNA 
synthesis, before and at various times after the pertur- 

25 bation in temperature. We therefore isolated RNA from the 
5 clones at various times after the change in temperature 
from 30 °C to 37 °C and visualized the gusA mRNA by stan- 
dard Nothern blotting (Maniatis et al . , 1983), to analyze 
how many fold the individual promoter clone was induced 

30 by the temperature change. 

EXAMPLE 3 

The Gram-positive bacterium, Bacillus subtilis is used 
35 extensively as an industrial bioreactor for the produc- 
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tion of a range of heterologous proteins. It was there- 
fore of interest to test whether the random spacers 
method of the invention could also be used to generate 
promoter libraries for this organism. The consensus se- 
5 quences for Bacillus subtilis are very similar to the 
consensus sequences for E. coll and L. lactis, and we 
could therefore test whether the approach was also valid 
for Bacillus subtilis by subcloning a number of CP pro- 
moters into a promoter cloning vector for this Bacillus 

10 subtilis and then ask 1) whether the CP promoters are ac- 
tive in Bacillus subtilis and 2) whether, also in this 
organism, the spacer between the consensus sequences 
plays an important role for the promoter strength. We 
chose to use the promoter cloning vector, pDG268, which 

15 is designed for integration of promoter fusions to lacZ 
into the amy locus on the Bacillus subtilis chromosome. 
The vector confers ampicillin and neomycin resistance, 
and it will replicate in E. coli for the initial cloning 
purposes, but not in Bacillus subtilis . When the line- 

20 arized form of the vector is transformed into Bacillus 
subtilis it will be inserted into the amy locus on the 
Bacillus subtilis chromosome. This example therefore also 
serves as an example of the use of the promoters gener- 
ated through the random spacers methods for modulation of 

25 chromosomally encoded genes. 

The CP promoters were indeed active also in Bacillus sub- 
tills, and the individual strength of the promoters were 
again very different from each other. The fact that the 
30 promoter library also covered a broad range of promoter 
activities in Bacillus subtilis, shows that the random 
spacers method is also valid for this organism. 
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EXAMPLE 4 

The CP promoters of example 1 were designed for use in L. 
lactls, but the consensus sequences of E. coll promoters 
5 are included in the sequence of the oligonucleotide given 
in example 1. Furthermore, the vector used for generating 
the CP promoter library in example 1 is a shuttle vector 
for L. lactls and E. coll, so this also allowed us to 
analyse the activity of the CP promoters in the Gram- 

10 negative bacterial host, E . coll. Figure 3 shows the ac- 
tivity of 33 of the CP promoters in E. coll. Clearly, the 
activity of the individual CP promoters is also here very 
different, and together the promoters cover a broad range 
of activities. Interestingly, the correlation between the 

15 strength of the individual promoters in E. coll and L. 
lactls was not very strong: some promoters that were 
found to be strong promoters in i. lactls were found to 
be weak in E. coll and vice versa. 

20 The activity of the promoters, in terms of 0- 
galactosidase activity, was generally much lower than the 
activity found in L. lactls. This was probably due to the 
fact that the promoter cloning vector, pAK80 is designed 
and optimised for use in the Gram-positive bacterium, L. 

25 lactls, and the translation efficiency in E . coll could 
therefore be low. We therefore sub-cloned three of the CP 
promoters (CP20, CP22 and CP25) into the promoter cloning 
vector, pCB267, which was designed for cloning E. coll 
promoters in upstream of a promoterless lacZ gene, encod- 

30 ing P-galactosidase . In this promoter system, the CP pro- 
moters turned out to be very efficient promoters, but the 
relative difference in strength between the three promot- 
ers was conserved. Thus, the CP25 promoter gave 2.5 fold 
higher activity than the hybrid promoter, ptac, which is 

35 considered to be among the strongest promoters known for 
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use in E. coll. These data therefore further demonstrate 
the strength of our approach: by analysing a relatively 
small number of promoter clones obtained through the ran- 
dom spacers method of the invention, we have managed to 
5 arrive at some of the strongest promoters in both E. coll 
and L. lactis. 

EXAMPLE 5 

10 Bacteria belonging to the Gram-negative species, Pseudo- 
monas , are becoming increasingly important due to their 
application in e.g. biodegradation of chemical waste 
products in polluted soil. But also here the genetic en- 
gineering is hampered by the lack of suitable promoters 

15 and expression systems. The literature on Pseudomonas 
promoters revealed that the consensus sequences for Pseu- 
domonas are somewhat less well defined than those of E. 
coll, L. lactis and Bacillus subtllls . Thus, in the -35 
region one often finds TTGR conserved (R=A or G) whereas 

20 the rest of the -35 consensus sequence is varying between 
different promoters. The -10 consensus sequence is proba- 
bly TATRAT. The spacing between the TTGR motive and the 
-10 sequence is 18-19 bp, which is equivalent of the 16- 
17 bp spacer often found in E. coll. It follows that the 

25 consensus sequences for vegetative promoters in this or- 
ganism are quite close to the consensus sequences for E. 
coll and L. lactis. 

We therefore tested a range of the CP promoters from ex- 
30 ample 1 in Pseudomonas putida by cloning the promoters 
into a cloning vector that contains a promoterless (3- 
galactosidase gene. Again the activity of the CP promot- 
ers differed in strength over a broad range of promoter 
activities. These results show that also in this species, 
35 the random spacers method of the invention could be used 
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to generate both relatively strong promoters and also a 
broad range of promoter activities covered in small steps 
of activity change 

5 EXAMPLE 6 

We also designed an oligonucleotide based on the consen- 
sus sequences stated in example 5 and incorporated multi- 
ple cloning sites as described in example 1. The follow- 
10 ing oligonucleotide was synthesised. 



The oligonucleotide was converted to double stranded DNA, 
15 using a primer homologous to the 3' end of the oligonu- 
cleotide and cloned upstream of a promoterless p- 
galactosidase gene on a promoter probe vector. The liga- 
tion mixture was transformed directly into a Pseudomonas 
putida strain with selection for the antibiotics resis- 
20 tance carried by the plasmid and on plates containing X- 
gal • This resulted in approximately 100 clones of various 
blue color intensities. Subsequently, 30 clones were ana- 
lysed for P-galactosidase activity as described above. 
These results showed that also in this species, the ran- 
25 dom spacers method of the invention could be used to gen- 
erate both relatively strong promoters and also a broad 
range of promoter activities covered in small steps of 
activity change. 

30 EXAMPLE 7 

Design of a degenerated oligonucleotide for a Saccharomy- 
cgs cexrevlslae promoter library. 

35 A 199 bp oligonucleotide was designed, which includes, 
starting from the 5' end: an EcoRI restriction site (for 



MCS-(N) 8-TTGR-Ni9-TATRAT-(N) 8-MCS 
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use in the subsequent cloning strategy, see below) , the 
consensus UASccN4p/ a 59 bp spacer (spacer 1), the con- 
sensus TATAAA sequence (TATA box), a 38 bp spacer (spacer 
2), the 23 bp repressor binding sequence (which is also 
5 the region in which transcription is initiated) , the 
spacer sequence between the TI box and the first codon of 
the ARG8 gene (this spacer is here the native sequence 
from the ARG8 gene including also the first codon of the 
ARG8 gene, ATG) , and, finally, a BamHI site was included, 
10 which is designed to give an in frame fusion with the P- 
galactosidase reporter gene, see below. 

The sequences of spacer 1 and spacer 2 were allowed to 
vary randomly, and the actual oligonucleotide mixture be- 
15 ing synthesized had the following degenerated sequence 
reported in SEQ ID No* 3: 

Plel 
EcoRI Hinfl 
20 Apol Maelll 

1 ..... 
5 * CAGAATTCGTGACTCANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 

UASGCK4P 

25 61 

NNNNNNNNNNNNNNNNTATAAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 
TATA-box 

Msel Tfil 
30 Aflll Maelll Hinfl 

121 ...... 

N CTCTTAAGTGCAAGTGACTGCGA ACATTTTTTTCGTTTGTTAGAATAATTCAAGAATCG 
TI box 

35 Mbol 

Dpnl 
Alwl 
NlalV 
BstYI 

40 BamHI 

Alwl 
Nlalll 
181 . 199 

CTAC CAAT C ATGGAT C C C G 3' 
45 ARG8 atart 



This oligonucleotide mixture was converted into double 
stranded DNA fragments (DSDF), using an oligonucleotide 
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complementary to the last 23 bp of the 3' end of the 199 
bp degenerated oligonucleotide as described in example 1. 
Subsequently, it was cloned into either of the two pro- 
moter cloning vectors, pYLZ-2 and pYLZ-6 (Hermann et al . , 
5 1992), as follows: the DSDF mixture and the vector were 
both double-digested with EcoRI and BamHI , and the DSDF 
were ligated to the vector DNA. The ligation mixture was 
transformed into J5\ coli as described in example 1, with 
selection for ampicillin resistance. Plasmid DNA was iso- 
10 lated from 500 individual clones and screened for the 
presence of putative promoter fragments by digestion with 
the restriction enzymes, EcoRI and BamHI. 

17 clones were found to have an EcoRI - BamHI insert of 
15 approximately 2 00 bp. Plasmid DNA from these 17 clones 
were transformed into S. cerevisiae, with selection for 
the URA3 marker carried by the plasmids and assayed for 
production of P-galactosidase . Figure 4 shows the result- 
ing activities of P-galactosidase for the 17 clones. All 
20 the promoters have higher activities than the cloning 
vector itself without promoter fragment inserted (pYLZ- 
2). More important however, also in this case the clones 
cover a range of promoter activities in small steps of 
activity change. 

25 

Sequence analysis revealed that the 17 clones discussed 
above had a perfect TATA box ( TAT AAA ) between the 
spacer 1 and spacer 2, while two of the 17 clones, YP24 
and YP435 each had a defect in the UASQCN4p- However, the 
30 activity of YP435 was 39 units which is close to the 
value obtained with YP212. These data then suggest that 
the impact of the random sequence of the spacers on the 
promoter strength, is stronger than the impact of the 
state of the UASQCN4p binding site. 



35 
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As discussed above, the artificial yeast promoters had 
built in two different regulatory features. One is that 
the promoters should be regulated by the presence of ar- 
ginine in the growth medium. To test whether the art if i- 
5 cial yeast promoters were also regulated by arginine, we 
grew a number of clones in minimal medium, with and with- 
out arginine (SD + 2% glucose; SD + 2% glucose + 1 g/1 
arginine) . Figure 5 shows the result of these experi- 
ments . The clones, YP18 r YP212, YP435 were indeed regu- 
10 lated 5, 8, 15 fold, respectively, by the presence of ar- 
ginine. YP183 was not regulated by arginine, and sequence 
analysis confirmed that this promoter clone had a 13 bp 
deletion in the arginine repressor binding site. 

15 We also tested whether the promoters were regulated by 
external amino acids in the growth medium, by analyzing 
the promoter activity of some of the yeast promoter 
clones in minimal medium (SD + 2% glucose) and rich me- 
dium (SC + 2% glucose-URA) . Figure 6 shows the result of 

20 these experiments. Indeed, the promoters present in 
clones YP18,YP212, YP435, and YP183 were regulated from 2 
to 10 fold, by the presence of amino acids. YP24 was not 
regulated, in accordance with the error that had occured 
in the UASQCN4p site on this clone, see above. 



The results on amino acid and specific arginine regula- 
tion demonstrate that the random spacers method of the 
invention can be used for generating promoters which 
cover a broad range of promoter activities and which can 
30 be regulated by external signals . The regulatory aspect 
of the invention is here examplified by the amino acid 
and arginine regulation, but is not limited to these 
cases . 
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